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MULTIDIMENSIONAL HASHED TREE BASED URL MATCHING ENGINE 

USING PROGRESSIVE HASHING 



BACKGROUND OF THE INVENTION 



1. Technical Field 



The present invention is directed to a mechanism for 
parsing and matching a uniform resource locator to a rule 

10 or resource. More specifically, the present invention is 
directed to a mechanism for progressively generating a 
hash of portions of a uniform resource locator and then 
identifying the hash of the portions within a 
multidimensional hash table to thereby match the uniform 

15 resource locator to a rule or resource. 



2. Description of Related Art: 



Matching of a uniform resource locator (URL) to 
20 rules or resources is a fundamental operation performed 
by server-side components that process URLs. Common URL 
rule matching applications include the Servlet/Java 
Server Page (JSP) engine, URL Authenticators, and the 
like . 

25 Typically, to match a URL to a resource, the URL is 

decomposed into constituent parts including a scheme 
portion, a web server name portions, a path portion, and 
a type portion. For example, the URL 

http://www.ibm.com/pagel.html includes a scheme portion 
30 that is u http" , the server name portions would be "com", 
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«ibm" and "www" , the path portion would be "pagel" and 
the type portion would be "html" . A decomposition tree 
structure is used to identify the files associated with a 
URL based on traversing the tree structure using the 
various components of the URL. Each component of the URL 
is hashed and its hash code appended to a prior component 
of the URL to obtain an accumulated URL hash code. Once 
the final hash code is determined by accumulating the 
hash codes for the components of the URL, a hash table 
lookup is performed to identify the files or resources 
associated with the URL. An example of such a system as 
described above is provided in U.S. Patent Application 
Publication No. 2002/0133570 which is hereby incorporated 

by reference. 

It would be beneficial to have an apparatus and 
method for matching a URL to rules and/or resources that 
provides increased performance over these known 
techniques . 
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SUMMARY OF THE INVENTION 



The present invention provides a mechanism by which 
URLs are progressively hashed character by character and 
5 clauses of the URL are used to traverse a tree data 
structure for matching of the URL to resources/rules. 
The hash code for a single character is appended to a 
prior hash code for a preceding character in the URL 
portion. At the time that the entire portion of the URL 

10 is hashed, as determined based on the presence of a 

delimiter character, the particular node in a tree data 
structure associated with the resulting hash code is 
identifiable within a hash table of a current node of the 
tree data structure. Each node in the tree data 

15 structure includes a multidimensional hash table for a 
portion of a URL. 

The multidimensional hash table is established and 
grown in a manner that ensures there are no hash 
collisions at each node. Each portion of the URL is 

2 0 parsed in this manner and the tree data structure is 

traversed as each portion is processed until the entire 
URL is parsed at which time the resulting rules/resources 
may be identified from the leaf nodes of the tree data 
structure . 

2 5 These and other features and advantages of the 

present invention will be described in, or will become 
apparent to those of ordinary skill in the art in view 
of, the following detailed description of the preferred 
embodiments . 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The novel features believed characteristic of the 
invention are set forth in the appended claims. The 
invention itself, however, as well as a preferred mode of 
use, further objectives and advantages thereof, will best 
be understood by reference to the following detailed 
description of an illustrative embodiment when read in 
conjunction with the accompanying drawings, wherein: 

Figure 1 is an exemplary block diagram of a 
distributed data processing system in which the present 
invention may be implemented; 

Figure 2 is an exemplary block diagram of a server 
computing device in which the present invention may be 
implemented; 

Figure 3 is an exemplary block diagram of a client 
computing device from which a URL request may be received 
by the server computing device; 

Figure 4 is an exemplary block diagram illustrating 
progressive hashing of portions of a URL in accordance 
with an exemplary embodiment of the present invention; 

Figure 5 is an exemplary block diagram illustrating 
traversal of a tree data structure in accordance with the 

present invention; and 

Figure 6 is a flowchart outlining an exemplary 

operation of the present invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention provides a mechanism for 
matching uniform resource locators (URLs) to resources 
5 and/or rules. The present invention is especially well 
suited for use with server computing devices in a 
distributed data processing system. For example, a 
client computing device may submit a URL to a server 
computing device which then matches that URL to a 

10 resource/rule to perform a subsequent function. Thus, in 
order to provide a context for the following description 
of the preferred embodiments of the present invention, a 
brief description of a distributed data processing system 
will be provided with reference to Figures 1-3. 

15 With reference now to the figures, Figure 1 depicts a 

pictorial representation of a network of data processing 
systems in which the present invention may be implemented. 
Network data processing system 100 is a network of 
computers in which the present invention may be 

20 implemented. k Network data processing system 100 contains 
a network 102, which is the medium used to provide 
communications links between various devices and computers 
connected together within network data processing system 
100. Network 102 may include connections, such as wire, 

25 wireless communication links, or fiber optic cables. 

In the depicted example, server 104 is connected to 
network 102 along with storage unit 106. In addition, 
clients 108, 110, and 112 are connected to network 102. 
These clients 108, 110, and 112 may be, for example, 

3 0 personal computers or network computers. In the depicted 
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example, server 104 provides data, such as boot files, 
operating system images, and applications to clients 108- 
112. Clients 108, 110, and 112 are clients to server 104. 
Network data processing system 100 may include additional 
5 servers, clients, and other devices not shown. In the 
depicted example, network data processing system 100 is 
the Internet with network 102 representing a worldwide 
collection of networks and gateways that use the 
Transmission Control Protocol/Internet Protocol (TCP/IP) 

10 suite of protocols to communicate with one another. At 

the heart of the Internet is a backbone of high-speed data 
communication lines between major nodes or host computers, 
consisting of thousands of commercial, government, 
educational and other computer systems that route data and 

15 messages. Of course, network data processing system 100 
also may be implemented as a number of different types of 
networks, such as for example, an intranet, a local area 
network (LAN) , or a wide area network (WAN) . Figure 1 is 
intended as an example, and not as an architectural 

2 0 limitation for the present invention. 

Referring to Figure 2, a block diagram of a data 
processing system that may be implemented as a server, 
such as server 104 in Figure 1, is depicted in accordance 
with a preferred embodiment of the present invention. 
25 Data processing system 200 may be a symmetric 

multiprocessor (SMP) system including a plurality of 
processors 202 and 204 connected to system bus 206. 
Alternatively, a single processor system may be employed. 
Also connected to system bus 206 is memory 

3 0 controller/cache 208, which provides an interface to local 
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memory 209. I/O bus bridge 210 is connected to system bus 
206 and provides an interface to I/O bus 212. Memory 
controller/cache 208 and I/O bus bridge 210 may be 
integrated as depicted. 
5 Peripheral component interconnect (PCI) bus bridge 

214 connected to I/O bus 212 provides an interface to PCI 
local bus 216. A number of modems may be connected to PCI 
local bus 216. Typical PCI bus implementations will 
support four PCI expansion slots or add- in connectors. 

10 Communications links to clients 108-112 in Figure 1 may be 
provided through modem 218 and network adapter 22 0 
connected to PCI local bus 216 through add- in boards. 

Additional PCI bus bridges 222 and 224 provide 
interfaces for additional PCI local buses 226 and 228, 

15 from which additional modems or network adapters may be 
supported. In this manner, data processing system 200 
allows connections to multiple network computers. A 
memory-mapped graphics adapter 230 and hard disk 232 may 
also be connected to I/O bus 212 as depicted, either 

2 0 directly or indirectly. 

Those of ordinary skill in the art will appreciate 
that the hardware depicted in Figure 2 may vary. For 
example, other peripheral devices, such as optical disk 
drives and the like, also may be used in addition to or in 

25 place of the hardware depicted. The depicted example is 
not meant to imply architectural limitations with respect 
to the present invention. 

The data processing system depicted in Figure 2 may 
be, for example, an IBM eServer pSeries system, a product 

30 of International Business Machines Corporation in Armonk, 
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New York, running the Advanced Interactive Executive 
(AIX) operating system or LINUX operating system. 

With reference now to Figure 3, a block diagram 
illustrating a data processing system is depicted in which 
the present invention may be implemented. Data processing 
system 300 is an example of a client computer. Data 
processing system 300 employs a peripheral component 
interconnect (PCI) local bus architecture. Although the 
depicted example employs a PCI bus, other bus 
architectures such as Accelerated Graphics Port (AGP) and 
Industry Standard Architecture (ISA) may be used. 
Processor 302 and main memory 304 are connected to PCI 
local bus 306 through PCI bridge 308. PCI bridge 308 also 
may include an integrated memory controller and cache 
memory for processor 302. Additional connections to PCI 
local bus 306 may be made through direct component 
interconnection or through add-in boards. In the depicted 
example, local area network (LAN) adapter 310, SCSI host 
bus adapter 312, and expansion bus interface 314 are 
connected to PCI local bus 306 by direct component 
connection. In contrast, audio adapter 316, graphics 
adapter 318, and audio/video adapter 319 are connected to 
PCI local bus 306 by add-in boards inserted into expansion 
slots. Expansion bus interface 314 provides a connection 
for a keyboard and mouse adapter 320, modem 322, and 
additional memory 324. Small computer system interface 
(SCSI) host bus adapter 312 provides a connection for hard 
disk drive 326, tape drive 328, and CD-ROM drive 330. 
Typical PCI local bus implementations will support three 
or four PCI expansion slots or add- in connectors. 
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An operating system runs on processor 302 and is used 
to coordinate and provide control of various components 
within data processing system 300 in Figure 3. The 
operating system may be a commercially available operating 
5 system, such as Windows XP, which is available from 
Microsoft Corporation. An object oriented programming 
system such as Java may run in conjunction with the 
operating system and provide calls to the operating system 
from Java programs or applications executing on data 

10 processing system 300. "Java" is a trademark of Sun 

Microsystems, Inc. Instructions for the operating system, 
the object-oriented programming system, and applications 
or programs are located on storage devices, such as hard 
disk drive 326, and may be loaded into main memory 304 for 

15 execution by processor 302* 

Those of ordinary skill in the art will appreciate 
that the hardware in Figure 3 may vary depending on the 
implementation. Other internal hardware or peripheral 
devices, such as flash read-only memory (ROM) , equivalent 

20 nonvolatile memory, or optical disk drives and the like, 
may be used in addition to or in place of the hardware 
depicted in Figure 3. Also, the processes of the present 
invention may be applied to a multiprocessor data 
processing system. 

25 As another example, data processing system 300 may 

be a stand-alone system configured to be bootable without 
relying on some type of network communication interfaces 
As a further example, data processing system 300 may be a 
personal digital assistant (PDA) device, which is 

3 0 configured with ROM and/or flash ROM in order to provide 



EXPRESS MAIL NO. EV333422384US 

10 

Docket No. RSW920030104US1 



non-volatile memory for storing operating system files 
and/or user-generated data. 

The depicted example in Figure 3 and above-described 
examples are not meant to imply architectural 
5 limitations. For example, data processing system 300 

also may be a notebook computer or hand held computer in 
addition to taking the form of a PDA. Data processing 
system 300 also may be a kiosk or a Web appliance. 

As mentioned above, the present invention provides a 

10 mechanism for matching URLs to resources/rules. The 

present invention uses a progressive hash of portions of 
the URL to determine how to traverse a tree data 
structure to arrive at leaf nodes representing the 
resources/rules associated with the URL. 

15 The tree data structure used by the present 

invention is a modification of known Dictionary or 
Digital Tree data structures that is modified for use 
with URLs. In the known Dictionary or Digital Tree data 
structure, an n-ary tree representation of a string is 

20 provided in which every node of the tree data structure 
represents a single character. Each node of the tree 
data structure has an array of subtrees indexed by the 
ASCII value of the next character. Thus, as a string is 
traversed one character at a time, the tree is traversed 

2 5 using the ASCII value of that character. Upon reaching 
the end of the string, the terminal node is examined for 
"targets" and those targets are returned as a successful 
match. If at any stage the tree traversal fails, i.e. 
there is no subtree corresponding to the ASCII value of 
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the next character, the result is that there are no 
matching targets for the given string. 

The present invention makes use of a modified form 
of the Dictionary or Digital Tree data structure that is 
modified for use with URLs. With the tree data structure 
of the present invention, each tree node represents a 
portion of the URL, i.e. a clause. A clause or portion 
of the URL is defined as a segment of the URL that is 
delimited by a particular character, e.g., "/"• Thus, 
for example, with the URL /www. ibm. com/ welcome/pagel .html 
the clauses or portions are www.ibm.com, welcome, and 
pagel . html . 

Each node of the tree data structure includes a 
multidimensional hash table for identifying subtrees of 
the tree data structure. In a preferred embodiment, the 
multidimensional hash table is a three dimensional table 
in which each entry of the table has an X, Y and Z 
coordinate in the table. For example, the target object 
in the multidimensional hash table is identified using 
the following equation: 

Th o { (h%X) , (h%Y) , (h%Z) } (D 

where T h is the target object entry, e.g., the 
subtree root node, h is the hash code of the clause or 
portion of the URL, X, Y and Z are the dimensions of the 
three dimensional hash table, and % is the modulo 
operator. The three dimensional hash table is grown such 
that there are no collisions between hashed values. That 
, is, when a put() method is called to place an entry into 
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the hash table for a node of the tree data structure, the 
multidimensional hash table is restructured with new 
dimensions that eliminate any possibility of a hash 
collision. 

5 For example, if the dimensions of the 

multidimensional hash table are x=l, y=2, z=3 and a new 
entry is to be placed in the hash table, the coordinates 
for where it will be placed in the table are first 
computed using equation (1) above. If the resulting space 

10 is empty, the new entry is added without any change to 
the table dimensions. If there was already a subtree at 
the computed location, the hash table is grown by 
iteratively adjusting the dimensions, such that all the 
elements of the table are in unique locations in the 

15 table as prescribed by equation (1) . The resulting table 
may end up, for example, with dimensions x=4, y=2 , z=3 . 
The next addition to the hash table may result in the 
dimensions being x=4 , y=5, z=3 followed by the next 
addition resulting in dimensions x=4, y=5, z=6, and so 

20 on. In this way, hash collisions are avoided. 

Hash collisions often decrease performance in hash 
table implementations. In the above example, where x=l, 
y=2, z=3, a collision would occur if one hash value (h in 
the above formula) would be equal to 6 for one word and 

25 12 for another word. This is because h%x=0, h%y=0, and 
h%z=0 for both h=6 and h=12. When growing the table, 
setting x=4, y=2, x=3 , for h=6, h%x=2 , h%y=0, and h%z=0. 
However, for h=12, h%x=0, h%y=0, and h%z=0. This puts 
the two items in different locations in the table. 



EXPRESS MAIL NO. EV333422384US 

14 

Docket No. RSW920030104US1 



handling hash collisions and performing rehashes to 
correct for the hash collisions. In addition, the need 
to traverse a list of possible matches, such as in the 
case with traditional bucket based hashing strategies, 
5 for the resolution of a given URL on a get() call is 
eliminated. 

Thus, given a hash code for a clause or portion of a 
URL, a target subtree of a tree data structure may be 
found by inserting the hash code as h and the dimensions 

10 of the multidimensional hash table, which in this case 
are X, Y and Z, in equation (1) . The entry in the 
multidimensional hash table corresponding to the 
resulting target value T h is then retrieved and used to 
identify the subtree to which the traversal of the tree 

15 data structure should proceed. 

The process is then repeated for each subsequent 
clause or portion of the URL until the entire URL is 
processed or there is no matching subtree for a 
calculated T h value, i.e. the URL does not match to any 

20 resources/rules. Once the URL is processed completely in 
this manner, the terminal node is searched for any target 
resources/rules and these resources/rules are returned as 
matches for the URL. Each node on the tree has the 
ability to store a reference to an object or identifier 

25 of a rule or resource. If there is not an identifier set 
in this target node, then the resource is assumed not to 
exist. When composing the tree, the target identifiers 
are set in the last node, i.e., the leaf nodes, of the 
URL added. 
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In order to obtain the hash code for a clause or 
portion of a URL, the present invention makes use of a 
progressive hashing mechanism that parses and hashes each 
character in the URL while identifying clauses or 
5 portions of the URL and traversing the tree data 

structure in virtually a parallel operation. With the 
present invention, when a URL is to be matched to a 
resource/rule, the URL is received and a hash code for a 
current clause/portion of the URL is initialized to a 

10 starting value. The next character in the URL is then 

identified and a hash of the character is generated based 
on a hashing algorithm. Hashing algorithms are generally 
known in the art and any known or later developed hashing 
algorithm may be used without departing from the spirit 

15 and scope of the present invention. 

Once the hash for the character is generated, the 
hash code is compared to at least one hash code 
corresponding to a delimiter, or "special", character. 
While the preferred embodiments are described as 

20 utilizing the "/" character as a delimiting character, 
the present invention is not limited to such. Rather, 
any character may be designated as a delimiting character 
without departing from the spirit and scope of the 
present invention. Moreover, more than one "special" 

2 5 character may be defined in the mechanism of the present 
invention with functions or nodes of the tree data 
structure being established for these "special" 
characters. A "special" character is one that is set 
apart from other characters as having a designated 

30 functionality associated with it. In the exemplary 
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embodiments, the "/" character is special in that it is 
designated as having the function of separating clauses 
or portions of a URL. 

If the hash of the character in the URL results in 
5 the hash code not matching a hash code of a special 

character, then the hash code for the character is added 
to the hash code for the current clause or portion of the 
URL and the process is repeated for the next character in 
the URL. If the hash of the character in the URL results 

10 in the hash code matching a hash code for a special 
character, which in the preferred embodiments is a 
delimiting character, then it is determined that the end 
of the clause or portion of the URL has been encountered. 
When the end of the clause or portion of the URL is 

15 identified, the hash code for the clause or portion has 
already been compiled. As a result, the present 
invention may use the compiled hash code for the current 
clause or portion of the URL to calculate T h and retrieve 
the identity of the subtree associated with the clause or 

2 0 portion of the URL that was progressively hashed. The 

traversal of the tree data structure may be performed by 
designating the current node to be the root node of the 
subtree identified by the target value T h in the 
multidimensional hash table of the previous node in the 

25 tree data structure. This process may be repeated for 
each subsequent clause/portion of the URL until the 
entire URL is processed or until a subtree corresponding 
to a clause in the URL cannot be identified, i.e. there 
are no resources/rules associated with the URL. 
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Once the URL is completely processed in the above 
manner, the terminal node, i.e. the current node for the 
last clause/portion of the URL, is searched for targets. 
That is, the resources/rules associated with the terminal 
5 node in the tree data structure are identified and 

returned as matched resources/rules associated with the 
URL. The server may then perform appropriate processing 
based on the identified resources/rules. 

Because the present invention makes use of 

10 progressive hashing and incremental traversal of the tree 
data structure with each hashed clause/portion of the 
URL, the traversal of the tree data structure and the 
progressive hashing may be performed virtually in 
parallel. This greatly increases the speed at which the 

15 mechanisms of the present invention match URLs to 

resources/rules when compared to existing methods of 
performing URL matching. 

Figure 4 is an exemplary block diagram illustrating 
progressive hashing of portions of a URL in accordance 

20 with an exemplary embodiment of the present invention. 
As shown in Figure 4, a portion of a URL " /foo/bar" is 
illustrated. The present invention hashes each character 
of the URL individually and then determines whether to 
added the hash code for the character to a hash code for 

25 a clause or portion of the URL, or whether the hash code 
for the character indicates the character to be a 
11 special " character . 

For example, as shown in Figure 4, during parsing of 
the URL, the first character "/" is evaluated to 

3 0 correspond to a special delimiter character and thus, the 
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present invention would perform a lookup of any hash code 
for a prior portion of the URL in a multidimensional hash 
table of the current node of the tree data structure. 
Since a special character was encountered, the 
5 present invention would reinitialize the hash code for a 
current URL clause to a starting value and then continue 
to parse the URL to identify the hash code of the next 
clause in the URL. The continued parsing of the URL 
results in the character " f" being hashed to a hash code 

10 of H2 . Since H2 does not correspond to a special 

character, H2 is added to the hash code for Hf 00 - This 
process is then repeated for each of characters "o" in 
"foo" resulting in hash codes H3 and H4 which are both 
added to the hash code H foo - When the next character "/" 

15 is evaluated, it is realized, before hashing the "/"/that 
this is the special delimiter character. Thus, the hash 
code Hfoo is then used to cal culate Th using the 
dimensions for the multidimensional hash table of the 
current node of the tree data structure. The Th value is 

20 then looked up in the multidimensional hash table to 

thereby identify the subtree to which traversal of the 
tree data structure should proceed. 

The root node for this subtree is then set as the 
current node and the operation continues with the parsing 

25 and hashing of the clause "bar" . The hash code H ba r is 

generated as the combination of the hash codes H6, H7 and 
H8 for the characters of this clause in the URL and is 
then used to calculate Th, based on the dimensions for 
the multidimensional hash table for the current node, 

3 0 i.e. the "foo" node, for this clause. The resulting 
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value for T h is then used to identify the subtree 
corresponding to this clause. Since this is the last 
clause in the URL, the node corresponding to "bar- would 
be searched for targets and the targets returned as 
matches for the URL »/foo/bar«. 

Figure 5 is an exemplary block diagram illustrating 
traversal of a tree data structure in accordance with the 
present invention. As shown in Figure 5, the tree data 
structure includes a plurality of nodes (represented by 
cubes denoting the multidimensional hash tables 
associated with the nodes) . The transition from one node 
to the next is illustrated by a line with an associated 
URL clause or portion. For example, as shown in Figure 
5, entries in the multidimensional hash table of node 510 
identify nodes 520, 530 and 540. A hash code 
corresponding to the URL clause -foo- results in the 
transition from node 510 to node 520 being the current 
node. A subsequent hash code corresponding to the URL 
clause "bar" results in the current node being reassigned 

to node 550, and so on. 

In one preferred embodiment of the present 
invention, in addition to determining the next clause in 
the URL, getting its hash code by progressive hashing of 
characters, and getting the target subtree for the clause 
from the hash table of the current node, the present 
invention checks the current node for any wildcard 
matches. For example, if there are any child nodes of 
the current node that are associated with a wildcard 
character, which is a form of special character, such as 
, ■.*», then the targets associated with this wildcard node 
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are added to a list of matching resources/rules for the 
URL. 

Thus, as shown in Figure 5, the targets associated 
with node 560 are added to the list of matched 
resources/rules. In addition, since "bar" is the last 
clause of the URL, the resources/rules associated with 
the node 570 are also included in the list of matched 

resources/rules . 

As an example of how to traverse the tree data 
structure shown in Figure 5, the following description is 
provided with reference to the exemplary URL 
"/foo/bar/defaultPage." A first step in retrieving 
matched resources/rules for - /f oo/bar/def aultPage 
involves initializing the current node to be the »/" node 
510. The first clause is then progressively hashed as 
previously described above. The subtree for the clause 
"foo- is then requested from the current node and the 
current node is checked for any child wildcard nodes. 
Since no wildcard nodes are found, the list of matched 

results is not modified. 

The subtree for "foo" is found and the current node 
is reset to the root node of the subtree for "foo", i.e. 
node 520. The next clause "bar" is parsed and 
progressively hashed. The subtree for "bar" is requested 
from the current node, i.e. the "foo" node. In addition, 
the "foo" node is checked for any child wildcard nodes. 
Since the "foo" node has a wildcard child node 560, the 
targets associated with the wildcard child node 560 are 
added to the list of matched resources/rules. 
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The subtree for "bar" is found in the hash table for 
the current node and the current node is reset to the 
root node of the subtree for "bar", i.e. node 550. The 
next clause "def aultPage" in the URL is parsed and 
progressively hashed. The subtree for "def aultPage" is 
requested from the current node, i.e. node 550, and the 
current node is checked for any child wildcard nodes. 
Since the node 550 has a child wildcard node 570, the 
targets associated with node 570 are added to the list of 
matched resources/rules. 

However, no subtree is found for the clause 
"def aultPage." The extension for "def aultPage" is 
determined to be null and thus, extension matching is 
ignored. An extension is an identification of a "type- 
element of a URL as discussed previously. Rules or 
resources provided to the present invention may be keyed 
to a particular URL "type" element, such as "html." 
Thus, for example, the rules associated with the present 
invention may include a rule corresponding to the pattern 
*.html. in such a case, in addition to identifying all 
the path matches for the input URL using the present 
invention, the extension of the input URL may also be 
examined to see if it matches any predefined extension 
patterns. If it does, the corresponding target may also 
be included in the list of targets matched for the input 
URL. 

For example, if the input URL were 
/foo/bar/defaultPage.html, and there was a predefined 
target/rule Tl defined for *.html, then, in addition to 
returning the rules associated with /foo/* and /foo/bar/* 
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Accordingly, blocks of the flowchart illustration 
support combinations of means for performing the 
specified functions, combinations of steps for performing 
the specified functions and program instruction means for 
5 performing the specified functions. It will also be 

understood that each block of the flowchart illustration, 
and combinations of blocks in the flowchart illustration, 
can be implemented by special purpose hardware-based 
computer systems which perform the specified functions or 

10 steps, or by combinations of special purpose hardware and 
computer instructions. 

As shown in Figure 6, the operation starts by 
receiving a URL for which associated resources/rules are 
to be identified (step 610) . The hash code for the 

15 current portion of the URL is initialized to a starting 
state (step 620) . The next character in the URL is then 
hashed to obtain a hash code for that individual 
character (step 630) . A determination is made as to 
whether the hash code identifies a delimiter character, 

20 e.g., a "/" character (step 640). If not, the hash code 
for the individual character is then added to any 
previously compiled hash code for the URL portion if any 
(step 650) . The process then continues by returning to 
step 630 where the next character in the URL is hashed. 

25 If the hash code of the character indicates that the 

character is a delimiter character, such as "/" / then the 
compiled hash code for the URL portion is used to 
identify a corresponding subtree within the tree data 
structure by looking up the hash code in the 

30 multidimensional hash table associated with the current 
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node of the tree data structure (step 660) . The root 
node for the subtree is then set as the current node 
(step 670) and a determination is made as to whether the 
end of the URL has been encountered (step 680) . If not, 
5 the operation returns to step 620. Otherwise, if the end 
of the URL has been encountered, then the rules/resources 
associated with the URL are identified as the leaf nodes 
associated with the current node (step 690) . The 
operation then terminates. 

10 Thus, the present invention provides a mechanism 

that utilizes progressive hashing of characters in a URL 
to traverse a tree data structure. As each portion of 
the URL is progressively hashed, the resulting hash code 
for the URL portion is used to perform a lookup in a 

15 multidimensional hash table associated with the current 

node of the tree data structure. In this way, a fast and 
easily used URL matching mechanism is provided for 
matching URLs to resources and/or rules in server 
computing devices is provided. 

2 0 It is important to note that while the present 

invention has been described in the context of a fully 
functioning data processing system, those of ordinary 
skill in the art will appreciate that the processes of 
the present invention are capable of being distributed in 

2 5 the form of a computer readable medium of instructions 
and a variety of forms and that the present invention 
applies equally regardless of the particular type of 
signal bearing media actually used to carry out the 
distribution. Examples of computer readable media 

30 include recordable -type media, such as a floppy disk, a 
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hard disk drive, a RAM, CD-ROMs, DVD-ROMs, and 
transmission- type media, such as digital and analog 
communications links, wired or wireless communications 
links using transmission forms, such as, for example, 
radio frequency and light wave transmissions. The 
computer readable media may take the form of coded 
formats that are decoded for actual use in a particular 
data processing system. 

The description of the present invention has been 
presented for purposes of illustration and description, 
and is not intended to be exhaustive or limited to the 
invention in the form disclosed. Many modifications and 
variations will be apparent to those of ordinary skill in 
the art . The embodiment was chosen and described in 
order to best explain the principles of the invention, 
the practical application, and to enable others of 
ordinary skill in the art to understand the invention for 
various embodiments with various modifications as are 
suited to the particular use contemplated. 



